NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Total Variation Distance Meets Probabilistic Inference

Bhattacharyya, Arnab; Gayen, Sutanu; Meel, Kuldeep; Myrisiotis, Dimitrious; Pavan, A; Vinodchandran, N V (July 2024, Proceedings of Machine Learning Research)

Full Text Available
On Approximating Total Variation Distance

https://doi.org/10.24963/ijcai.2023/387

Bhattacharyya, Arnab; Gayen, Sutanu; Meel, Kuldeep S.; Myrisiotis, Dimitrios; Pavan, A.; Vinodchandran, N. V. (August 2023, International Joint Conference on Artificial Intelligence)

Total variation distance (TV distance) is a fundamental notion of distance between probability distributions. In this work, we introduce and study the problem of computing the TV distance of two product distributions over the domain {0,1}^n. In particular, we establish the following results.1. The problem of exactly computing the TV distance of two product distributions is #P-complete. This is in stark contrast with other distance measures such as KL, Chi-square, and Hellinger which tensorize over the marginals leading to efficient algorithms.2. There is a fully polynomial-time deterministic approximation scheme (FPTAS) for computing the TV distance of two product distributions P and Q where Q is the uniform distribution. This result is extended to the case where Q has a constant number of distinct marginals. In contrast, we show that when P and Q are Bayes net distributions the relative approximation of their TV distance is NP-hard.
more » « less
Full Text Available
Near-optimal learning of tree-structured distributions by Chow-Liu

https://doi.org/10.1145/3406325.3451066

Bhattacharyya, Arnab; Gayen, Sutanu; Price, Eric; Vinodchandran, N. V. (June 2021, STOC 2021: Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing)
null (Ed.)
Full Text Available
Testing Product Distributions: A Closer Look

Bhattacharyya, Arnab; Gayen, Sutanu; Kandasamy, Saravanan; Vinodchandran, N. V. (March 2021, International Conference on Algorithmic Learning Theory)
null (Ed.)
We study the problems of identity and closeness testing of n-dimensional product distributions. Prior works of Canonne et al. (2017) and Daskalakis and Pan (2017) have established tight sample complexity bounds for non-tolerant testing over a binary alphabet: given two product distributions P and Q over a binary alphabet, distinguish between the cases P = Q and dTV(P;Q) > epsilon . We build on this prior work to give a more comprehensive map of the complexity of testing of product distributions by investigating tolerant testing with respect to several natural distance measures and over an arbitrary alphabet. Our study gives a fine-grained understanding of how the sample complexity of tolerant testing varies with the distance measures for product distributions. In addition, we also extend one of our upper bounds on product distributions to bounded-degree Bayes nets.
more » « less
Full Text Available
Efficient Distance Approximation for Structured High-Dimensional Distributions via Learning

Bhattacharyya, Arnab; Gayen, Sutanu; Meel, Kuldeep S; Vinodchandran, N. V. (December 2020, Annual Conference on Neural Information Processing Systems)
Larochelle, Hugo; Ranzato, Marc'Aurelio; Hadsell, Raia; Balcan, Maria-Florina; Lin, Hsuan-Tien (Ed.)
Full Text Available
Perfect Zero Knowledge: New Upperbounds and Relativized Separations

https://doi.org/10.1007/978-3-030-64375-1\_24

Dixon, Peter; Gayen, Sutanu; Pavan, A.; Vinodchandran, N.V. (October 2020, Theory of Cryptography - 18th International Conference, {TCC})
Pass, Rafael Pass; Pietrzak, Krzysztof Pietrzak (Ed.)
We investigate the complexity of problems that admit perfect zero-knowledge interactive protocols and establish new unconditional upper bounds and oracle separation results. We establish our results by investigating certain {\em distribution testing problems}: computational problems over high-dimensional distributions represented by succinct Boolean circuits. A relatively less-investigated complexity class $$\SBP$$ emerged as significant in this study. The main results we establish are: 1. A unconditional inclusion that NIPZK is a subset of CoSBP. 2. Construction of a relativized world in which there is a distribution testing problem that lies in NIPZK but not in SBP, thus giving a relativized separation of NIPZK (and hence PZK) from SBP. 3. Construction of a relativized world in which there is a distribution testing problem that lies in PZK but not in CoSBP, thus giving a relativized separation of PZK from CoSBP.. Results (1) and (3) imply an oracle separating PZK from NIPZK. Our results refine the landscape of perfect zero-knowledge classes in relation to traditional complexity classes.
more » « less
Full Text Available
Efficient Distance Approximation for Structured High-Dimensional Distribution via Learning

Bhattacharyya, Arnab; Gayen, Sutanu; Meel, Kuldeep S.; Vinodchandran, N. V. (December 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Learning and Sampling of Atomic Interventions from Observations

Bhattacharyya, Arnab; Gayen, Sutanu; Kandasamy, Saravanan; Maran, Ashwin; Vinodchandran, N. V. (July 2020, Proceedings of the 37th International Conference on Machine Learning)

We study the problem of efficiently estimating the effect of an intervention on a single variable using observational samples. Our goal is to give algorithms with polynomial time and sample complexity in a non-parametric setting. Tian and Pearl (AAAI ’02) have exactly characterized the class of causal graphs for which causal effects of atomic interventions can be identified from observational data. We make their result quantitative. Suppose 𝒫 is a causal model on a set V of n observable variables with respect to a given causal graph G, and let do(x) be an identifiable intervention on a variable X. We show that assuming that G has bounded in-degree and bounded c-components (k) and that the observational distribution satisfies a strong positivity condition: (i) [Evaluation] There is an algorithm that outputs with probability 2/3 an evaluator for a distribution P^ that satisfies TV(P(V | do(x)), P^(V)) < eps using m=O (n/eps^2) samples from P and O(mn) time. The evaluator can return in O(n) time the probability P^(v) for any assignment v to V. (ii) [Sampling] There is an algorithm that outputs with probability 2/3 a sampler for a distribution P^ that satisfies TV(P(V | do(x)), P^(V)) < eps using m=O (n/eps^2) samples from P and O(mn) time. The sampler returns an iid sample from P^ with probability 1 in O(n) time. We extend our techniques to estimate P(Y | do(x)) for a subset Y of variables of interest. We also show lower bounds for the sample complexity, demonstrating that our sample complexity has optimal dependence on the parameters n and eps, as well as if k=1 on the strong positivity parameter.
more » « less
Full Text Available
Learning and Sampling of Atomic Interventions from Observations

Bhattacharyya, Arnab; Gayen, Sutanu; Kandasamy, Saravanan; Maran, Ashwin; Vinodchandran, N. V. (July 2020, International Conference on Machine Learning)
null (Ed.)
Full Text Available

Search for: All records